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Height-related Gene 




The sex-related height difference in humans is thought to be caused mainly by two 
components: first, a hormonal component determined by the sex dimorphism of bioactive 
gonadal steroids and second, a genetic component attributed to a Y-specific growth gene, 
termed GCY (Tanner, et al. 1966; Smith, et aL 1985; Ogata and Matsuo, 1992). Despite 
extensive mapping attempts for this gene on the human Y chromosome (Ogata, et al. 
1995, Salo, et aL 1995, Rousseaux-Prevost, et aL 1996, De Rosa, et al. 1997), its precise 
position remains unknown. Recent evidence shows that inappropriate cytogenetic 
methodology in the characterization of Y-chromosomal terminal deletions has brought 
about some of the difficulties in elucidating the GC Y-critical region. In order to overcome 
these problems, die inventors have considered only patients presenting de novo interstitial 
deletions for the GCY analysis on the Y chromosome (Kirsch, et al. 2000). This approach 
allows the assignment of GCY to a particular chromosomal interval without excluding the 
presence of XO-mosaicism and/or i(Yp) and idic(Yqll) chromosomes in patients with 
terminal deletions. 

The direct comparison of overlapping interstitial deletions in seven adult males with 
normal height, one male with borderline height, and one patient with a large interstitial 
deletion and short stature resulted in the confirmation of the GCY critical interval between 
markers DYZ3 and DYS11. This region roughly encompasses 1.6-1.7 Mb of genomic 
DNA. To improve the resolution in the region of interest close to the centromere, the 
inventors have established additional new STS markers specific for this part of the 
chromosome using our bacterial artificial chromosome (BAC)/P1 -derived artificial 
chromosome (PAQ contig. Molecular deletion analysis using these new Y-chromosomal 
STSs allowed the inventors to narrow down the critical interval to a genomic region of 700 



Preferably the regions are to the exclusion of the regions of chromosomes on each side of 
the defined regions. 
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Preferably the region is between SKY1 and sY83. It may include one or both the SKY1 
and the sY83 regions. Preferably the region is between SKY8 and sY83 (preferably 
includes one or both of the SKY8 and sY83 regions), or SKY1 and SKY4. 

The invention provides an isolated region of the Y chromosome between DYZ3 and 
DYS11 which encompasses GCY. Preferably the Y chromosome is a human Y 
chromosome. 

The preferred region is between sY79 and sY81, preferably to the exclusion of the region 
of the Y chromosome outside that area of the chromosome. 

Primers for use in GCY studies are also provided 

The invention further provides isolated gene/pseudogene sequences which contributes the 
sex related height difference in humans. These may be one or more of the gene or 
pseudogehe sequences identified in one or more of the figures. 

The invention further encompasses proteins having the same function as GCY protein and 
which have greater than 65% homology, greater than 70% homology, greater than 75% 
homology, greater than 80% homology, greater than 85% homology, preferably greater 
than 90% homology, and most preferably greater than 95% homology to the GCY protein. 
Preferably this has GCY gene activity, for example it has an effect on the height of a male 
mammal when expressed in that mammal. 

Primers for use in detecting or amplifying a region of GCY are also provided. They may 
be labelled using radioactive or non-radioactive labels known in the art and used using well 
known methods. These methods include PCR, Southern or Northern blottipg. 

Experimental evidence will now be described in detail with reference to the figures in 
which: 

Table 1 is a comparison of the adult height of patients and their siblings. 
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Table 2 is a table of new Y chromosomal STSs 

Table 3 is the PCR/restriction digest analysis of sequence family variants in the AZFc 
region 

Table 4 is a summary of BAC and PAC clones identified during physical map creation. 

Table 5 is a summary of the genomic primers that will be used for microdeletion screening 
in adult males with idiopathic short stature. 

Table 6 is a summary of the sequences of the isolated exon trap clones 

Table 7 A is a summary of primer pairs for predicted genes, 

7B is a summary of primer pairs specific for the Y-copy of Adlican (ADLY), 
7C is summary of RT-PCR primer sequences for ADLY, 

Table 8 is RT-PCR primer sequences for exon trap clones, 

Tables 9a & b are tables showing homology of exons between ADLX and ADLY. 

Table 10 is a summary of sequence divergence of genes/pseudogenes from the GCY region 
and their homology. 

Figure 1. Deletion mapping on the long arm of the human Y chromosome. 

A diagram of the human Y chromosome with Yp telomere to the left and Yq telomere to 
the right is presented at the top. Shown below are the results of low-resolution analysis of 
Y-chromosomes of adult males with normal height or short stature. Along the top border, 
95 Y-chromosomal STSs are listed. Except for SKY3 and SKY8 (see Table 2 for detail), 
all other STSs were previously reported (Vollrath et al., 1992, Jones et al., 1994, Reijo et 
aL, 1995). Blank spaces or grey boxes indicate inferred absence or presence of markers for 
which assay was not performed. Asterisks indicate markers in the respective breakpoint 
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regions which could not be tested In all cases where previously published data of the 
patients were re- investigated, the identical DNA sample used for the primary analysis was 
studied (Please note that the proximal as well as the distal breakpoint of the interstitial 
deletion of patient #293 resides within satellite type II sequences.) 

Figure 2. Sequence family variant (SFV) typing in the human DAZ locus in distal 
Yqll.23. 

A. Overview and amplicon structure of the human Y chromosome in the vicinity of the 
human DAZ cluster. Each amplicon is represented by specific bands (A, B, D, E, X). 
Shown above are arrows indicating the orientation of each member of an amplicon family 
with respect to each other. The amplicon indicated by bands X arose from a portion of 
chromosome 1 that was transposed to the distal end of the DAZ cluster and partially 
duplicated. 

B- Precise position of selected Y-specific STSs and the SFVs according to the physical 
map of the human Y chromosome. Marker sY157 is highlighted as it was suspected to be 
present in only one copy by multiplex PCR analysis (see text for detail). 

C. Summary of STS and SFV analysis in patients with Y- chromosomal rearrangements 
within the human DAZ cluster region. Grey boxes indicate inferred absence or presence of 
markers. 

D. Sequence family variant typing of SKY10 and SKY12 in genomic DNA of patient 
#1972. Assay is described in Table 3. Along the right are listed fragment sizes (in bp). 
Products are separated by electrophoresis in 3% NuSieve agarose (3:1) and visualized by 
ethidium bromide staining. 

Figure 3. Schematic representation of the organization of the long arm 

pericentromeric region of the human Y chromosome 

A. Diagram showing the distribution of major tandem repeat blocks and general 
organization of sequence homologies. Basically, the region can be subdivided in three 
distinct intervals: a proximal region characterized by 5bp satellite sequences (G), a central 
region with high homology to chromosome 1 (O), and a distal region composed of 
X/Y-homologous sequences (B). Below the precise position of the newly established and 
previously published STS markers in this region are illustrated. At the bottom border, the 
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PAC/BAC contig constructed with the aid of the new STS markers is shown. Prefixes RP1, 
5 indicate PAC clones and RP11 BAC clones, respectively. 

B. Localization of the GCY critical interval as defined by high-resolution STS mapping in 
patients with short stature and normal height Black boxes indicate the presence, white 
boxes the absence of the respective STS. Striped boxes depict the dosage unknown regions 
where the breakpoint resides. 

Figure 4* Molecular characterization of the GCY critical region a. Schematic illustration of 
the deletions in the two most crucial patients. SKY1 and sY83 demarcate its boundaries 
because clone Y0308 was found to have a different deletion (see Fig. 3) marking SKY1 as 
one of its boundaries. The AZFa region distally adjacent to the GCY region is indicated, b. 
Structural compartmentalization in three segments with distinct homologies. The segment 
composed of 5bp repeats is shown in green, the segment homologous to chromosomal 
subinterval lq43 in orange, and the segment homologous to Xp22 in blue, c Detailed 
description of annotated BAC clones sourcing the genomic sequence of the GCY region. <L 
Precise positioning of PAC clones used as substrates for exon amplification, e. Location of 
all exon trap clones. Due to its small size and limited single-copy content contained within 
exon trap clone etal was not amenable for further experimental analyses / Documentation 
of all in silica generated data sets in subsequent layers: gene models (orientation; 
exon/intron structure) - apparent pseudogenes (exon/intron structure; orientation) - 
promoters. Orientation of gene models can be deduced by colour (red: orientation towards 
the centromere; blue: orientation towards the telomere). Please note that the chromosomal 
region covered by CITB-144J01, CITB-298B15, and CITB-203M13 was already 
intensively studied in Sargent et al. 1999. 

Figure 5. Homology comparisons between genes/pseudogenes of the GCY region and their 
functional progenitors. Precise location of the Y-chiomosomal copies is indicated. Gene 
pair-specific homology and subchromosomal location of the actual structural gene is shown 
in blue. 

Figure 6. Evolutionary history of KIAA1470. On the left, chromosomal movements are 
illustrated. The upper lateral bar shows the exon/intron structure of the functional 
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progenitor in lp36. Successive degenerating events have shaped KIAA1470 into the two 
pseudogenes on lq43 and Yqll. Both copies on lq43 and Yqll share a 98% nucleotide 
sequence homology with each other, the highest among the 12 retroposons of the 
KIAA1470 gene family. They show 77% and 79% homology to the master gene. 

Figure 7. Comparison of the structural features of the X- and Y-specific adlican 
gene/pseudogene. The coding exons of ADLX are illustrated as boxes. Corresponding 
putative exons of ADLY (also presented as boxes) were identified by homology searches. 
Major rearrangements in the putative transcriptional unit of ADLY are highlighted as black 
triangles. Sizes of mRNAs and ORF are presented for ADLX and ADLY. Primers used in 
RT-PCR assays are shown at their respective locations. Identical colouring above and 
below the separation line indicate non-selectivity for both transcripts. RT-PCR primers 
exclusively presented below die line are specific for ADLY. 

Materials and Methods 

Defining the GCY critical region 

Selection of patients 

Patients #293, JOLAR, #28, #63 and #95 have been described clinically in detail elsewhere 
(Skare et al. 1990; Ma et aL 1993; Foresta et al. 1998; Kleiman et aL 1999). Patient Y0308 
corresponds to case 1 in the study of Pryor et al. 1997. Patients TM., #1947 and #1972 are 
phenotypically normal males suffering from idiopathic infertility. Genomic DNA samples 
were extracted from peripheral blood leukocytes (#28, #63, #95, Y0308, T.M., #1947, 
#1972) or from lymphoblastoid cell lines (#293, JOLAR). DNA isolated from peripheral 
blood leukocytes of normal males and females served as internal controls. 

Height assessment 

As all individuals are of diverse ethnic origins, height was compared to the respective 
national height standards (Table 1). Patients were of similar age range. When possible, 
special attention was given to adult height comparisons between parents and siblings. Data 
are summarized along with the height standard deviation score (SDS) in Table 1. To 
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calculate the SDS, mean adult height and the standard deviation were taken from the 
corresponding national physical growth studies. 

PCJR analysis 

Reactions were performed in a total volume of 50ul (75mM Tris/HCl pH9.0, 20mM (NHO2 
S0 4 , 0.1%(w/v) Tween20, 1.5mM MgCl 2 ) containing l.OmM of each oligonucleotide 
primer, lOOng genomic DNA as template, 5 units of Taq DNA polymerase (Eurogentec), 
and each dNTP at lmM in a thermocycler (MJ Research, Inc.) as follows: After an initial 
destination step of 95°C for 5min, samples were subjected to 30 cycles consisting of 
30sec at 94°C, 30sec at 60°C and lmin at 72°C followed by a final extension step of 5min 
at 72°C. The Multiplex PCR was carried out as described in Henegariu et al. 1994 with 
minor modifications. Alu-Alu PCR reactions were essentially carried out as described in 
Nelson et al. 1991. Amplification products smaller than 1 kb were resolved on 3% NuSieve 
agarose/l%SeaKem GTG agarose (FMC) in 1 x TBE (0.089 M Tris-borate/0.089 M boric 
acid720mM EDTA, pH 8.0). For amplification products larger than 1 kb as well as products 
from Alu-Alu-PCR, 1 .5% SeaKem GTG agarose gels in 1 x TBE were used for separation. 

PCR primers 

Y-specific STSs, loci and PCR conditions have been described previously (Vollrath et al. 
1992; Jones et al. 1994; Reijo et al. 1995). Sequences of new Y-chromosomal STSs are 
listed in Table 2. Y-specific STSs termed SKY were either derived from YAC, BAC and 
PAC end sequences or from clone-internal sequences amplified by various combinations of 
Alu primers. Primers for the markers SKY10, 11, 12, and 13 were designed to amplify 
fragments spanning unique restriction sites within the genomic DAZ locus (SKY10 from 
RP11-487K20 (AC024067), RP11-70G12 (AC006983), RP11-141N04 (AC008272), 
RP11-366C06 (AC015973), RP1 1-560118 (AC053522), RP11-175B09 (AL359453), 
SKY11 and SKY12 from RP11-245K04 (AC007965), RP11-100J21 (AC017005), 
RP11-506M09 (AC016752), RP11-589P14 (AC025246) and SKY13 from RP11-100J21 
(AC017005), RP11-589P14 (AC025246), RP11-823D08 (AC073649), RP11-251M08 
(AC010682), RP11-978G18 (AC073893)) in order to detect 'sequence femily variants' 
(SFVs). 
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Restriction a nalysis of PCR products 

PCR products were resolved on agarose gels, the appropriate gel bands cut out and the 
DNA isolated with GFX™ PCR DNA and Gel Band Purification Kit 
(Amersham Pharmacia Biotech, Inc.) according to the manufacturer's protocol. Fragments 
amplified from SKY5 and SKY6 were digested with TaqI and BsmL respectively. To 
detect SFVs at SKY10, SKY11, SKY12 and SKY13, PCR products were digested with 
restriction enzymes as listed in Table 3. 

Se qnencing of BAC/PAC/ V Af? end fragments 

DNA from BAC/PAC clones selected for end sequencing were purified with the 
Nucleobond PC 100 Kit (Macherey-Nagel) according to the manufacturer's instructions. 
End fragments were directly sequenced using the Thermosequenase Fluorescent Labelled 
Primer Cycle Sequencing Kit (Pharmacia) and analyzed on a Pharmacia AX.F. express 
(Amersham Pharmacia Biotech). YAC end fragments were generated with 
Alu/Vector-pofymerase chain reaction and subcloned in pCR2.1 with the TOPO-TA 
cloning Kit (Invitrogen). Sequencing was performed as described. 

Fluorescence in situ hybridization 

Metaphase spreads were obtained either from primary blood samples or immortalized cell 
lines. Preparations were made according to standard protocols (Lichter and Cremer 1992). 
Cosmid and plasmid DNA was labeled by nick translation with biotin-16-dUTP (La 
Roche). Slides carrying metaphase spreads were kept in 70% ethanol at 4°C for one week. 
200-300ng of labeled plasmid or cosmid DNA, 20-30ug of human Cot-1 DNA (GIBCO 
BRL), and hybridization buffer (50% formamide, 10% dextran sulfate, and 2 x SSC, pH 
7.0) were mixed, denatured for 5min at 75°C and pre-annealed for 30min at 37°C. The 
slides were denatured for 2 min in 70% formamide and 2 x SSC, pH7.0, at 72°C (Ried et 
al. 1992). The pre-annealed probe was hybridized overnight in a humidifying chamber at 
37°C. Slides were washed and stained with avidin-conj ugated fluorescein isothiocyanate 
(FITC). The signal was amplified with biotinylated anti-avidin followed by staining with 
avidin-FITC. For the probe all human telomeres (Oncor) the instructions supplied by the 
manufacturer were followed. Chromosomes were counterstained with 
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4',6-dianudino-2-phenylindol dihydrochloride (DAPI). Images were taken separately by 
using a cooled charge coupled device camera system (Photometries, Tucson AZ, USA). A 
Macintosh Quadra 900 was used for camera control and digital image acquisition in the 
TIF format using the software package Nu200 2.0 (Photometries). Separate gray scale 
fluorescence images were recorded for each fluorochrome. Images were overlaid 
electronically and further processed using the Adobe Photoshop software. 

Searching the stature gene 

Microdelp rtQ" spiralling 
Exon amplification 

Shotgun subcloning of PAC clones into pSPUB. Genomic DNA from chromosome Y 
specific PAC clones was partially digested with Sau3AI. lOOng of isolated fragments in the 
range of 4-10Kb were ligated with lOOng of pSPL3B that had been BamHI digested and 
dephosphorylated. The ligation reaction was transformed into supercompetent E.coli XI- 1 
blue cells (Stratagene) and aliquots of each transformation plated on selective medium 
(ampicillin). Resulting colonies were subsequently pooled for plasmid DNA isolation. 

Cell culture and electroporation. COS7 cells were propagated in DME medium 
supplemented with 10% heat inactivated calf serum. For transections COS7 cells in 
between the 5 a and IS* passage were grown to about 75% confluence, trypsinized, 
collected by centrifugation and washed in ice-cold Dulbecco's PBS. 4x1 0 9 cells were then 
resusp ended in cold 0.7ml Dulbecco's PBS and combined in a precooled electroporation 
cuvette (0.4cm chamber, BioRad) with 0.1ml Dulbecco's PBS.containing 15ug DNA. After 
lOmin on ice, cells were gently resuspended, elecfroporated (12kV, 25uf) in a BioRad 
Gene Pulser 2 and placed on' ice again. After lOmin cells were transferred to a tissue 
culture dish (lOOmm) containing 10ml prewarmed, C0 2 preequilibrated culture medium. 
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RNA isolation, RT-PCR and cloning. Cytoplasmic RNA was isolated 72hrs post 
transfection (QIAGEN RNeasy Kit) and first strand synthesis was performed as 
recommended by the manufacturer with minor modifications: 5\ig of RNA was added to a 
solution containing lOmM of each dNTP and 2fjM of ohgonucleotide SA2. The mixture 
was heated to 65°C for 5min and then placed on ice for at least a further minute. After 
adding a reaction mixture containing lOx PCR buffer (Perkin-Elmer Cetus), 25mM MgCl 2 , 
0.1M DTT and RNAsin (35U/n0 , the reverse transcription reaction was transferred to 
42°C for 2min. Ifil of Superscript II RT (200U/fd; Gibco BRL) was them added and the 
reaction incubated at 42°C for 90min and 50°C for 30min. The entire cDNA synthesis 
reaction was thai converted to double strand DNA using a limited number of PCR 
amplification cycles in the following lOOfil reaction mixture: lx PCR buffer (Perkin-Elmer 
Cetus), 1.5mM MgCl 2 , 200nM dNTPs, ljiM SA2, lfiM SD6 and 2.5U Taq polymerase 
(Perkin-Elmer Cetus). 6 amplification cycles were used and consisted of Imin at 94°C, 
lmin at 60°C and 5min at 72°C. To eliminate vector-only and false positive products, SOU 
of BstXI (New England Biolabs) was added directly to the reactions, followed by overnight 
incubation at 55°C. 

lOfil of the digest was then used in a second PCR amplification using internal primers in 
the following 100^1 reaction mixture: lxPCR buffer (Perkin-Elmer Cetus), 1.5mM MgCl 2 , 
200pM dNTPs, l\M (CAU) 4 -SD2, IfiM (CUA) 4 -SA4 and 2.5U Taq polymerase 
(Perkin-Elmer Cetus). 25 amplification cycles were used and consisted of lmin at 94°C, 
lmin at 60°C and 3min at 72°C. Products ware separated by electrophoresis and fragments 
larger than the pure SD2/SA4 RT-PCR product excised and subcloned (CloneAmp pAMPl 
System; Gibco BRL) into pAMPl according to the manufacturer's protocol. Ligation 
reactions were then transformed in ultracompetent E.coli XL-2 blue (Stratagene) and plated 
on selective medium containing X-Gal/IPTG. 

Identification of candidate exons. All white colonies were picked and transferred to 
384-well microliter plates containing selective medium and incubated overnight at 37°C. 
With a 384-pin transfer device 24.5x24.5cm culture plates with and without positively 
charged nylon membranes (Amersham) on top of them were inoculated and also incubated 
overnight at 37°C. Colonies grown on culture plates were pooled for plasrnid preparation, 
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colonies on nylon membranes were used for colony lifts. Plasmid inserts were excised, 
purified, and hybridized to nylon membranes containing EcoRI-digests of the PAC clones 
used as the original substrate. Highlighting bands were subsequently isolated and 
hybridized to colony lifts to identify candidate exons. Candidate exons were isolated and 
sequenced by Sequitherm EXCEL II DNA Sequencing Kit (Epicentre Technologies). 
Sequences were automatically analyzed and read on an ALFExpress DNA sequencer. Table 
6 lists the sequences of the isolated exon trap clones. 

Exon Trapping. DNA from chromosome Y specific PAC (PI -derived artificial 
chromosome) clones RP1-148J07, RP5-1160A12, RP1-301P22, RP4-532I07 and 
RP1-114A11 was partially digested with Sau3AI and fragments in the range of 4-1 0Kb 
were individually subcloned into pSPL3B. COS7 cells were transfected and after 72hrs 
cytoplasmic RNA was harvested using QIAGEN RNeasy Kit cDNA synthesis was 
performed as recommended by die manufacturer (Gibco-BRL). Primers flanking the 
cloning sites were used to identify products larger than the pure SD2/SA4 RT-PCR 
product These fragments were excised, subcloned (CloneAmp pAMPl System; Gibco 
BRL) into pAMPl and sequenced. Exon trap clones were labelled with 32 P-dCTP by 
random priming and used as hybridization probes on Southern blots. Hybridization: 16 hrs 
at 65°C in standard hybridization buffer (Singh and Jones 1984). Wash: three times for 20 
min each at 65°C in O.lxSSC, 0.1% SDS. 

In silico gene prediction. Completed genomic sequences from BAC clones RP11-75F05, 
RP11-461H06, RP11-333E09, RP11-558M10, CriB-298B15 and CrTB-144J01 were 
analyzed for homologies to known genes and virtual gene content using the MX 
(httoV/menu.hgmp.mrc.ac.uk) and Rummage (http^/genl00.imb-jena.de) software 
packages. Computational identification of promoters and first exons was achieved by 
submitting BAC sequences to FirstEF (http://www.cshl.org/mzhanglab). 

Reverse-transcribed pofyA + -RNAs and cDNA libraries. Human polyA + -RNA of 16 fetal 
and adult tissues was purchased either from Clontech or Invitrogen. Human polyA + -RNAs 
from 3 osteosarcoma and 1 bone marrow fibroblast cell line were isolated by the QIAGEN 
Oligotex kit. First-strand cDNA synthesis was essentially carried out as described (Rao et 
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al. 1997). Fourteen cDNA libraries were obtained either from Clontech or Stratagene. A 
collection of 40 cDNA libraries was also provided by the Resource Center of die German 
Human Genome Project (RZPD). The complete list is available on request 

Characterization of potential transcription units. After homology comparison and open 
reading frame (ORF) analysis of exon trap clones, primers were designed for RT-PCR 
amplification. Sequences are summarized in Table 8. In those cases where exon trap clones 
consisted of only one exon, two exon-specific primers were combined with cDNA-library 
specific primers in semi-nested PCR. Primers were designed from predicted gene models to 
amplify across exon/intron boundaries. To provide evidence of transcription, primers were 
used to screen a panel of cDNA libraries and poIyA + -RNAs (see above). In the case of 
potential co amplification from homologous transcripts, primers flanked Y-specific 
restriction sites. 

Evolutionary strata classification. Sequence divergence between genes/pseudogenes of the 
GCY region and their functional/non-functional progenitors was determined according to 
Li, 1993. Sequences for all pseudogenes were extracted from genomic sequences: 
KIAA1470PY from BAC clone RP11-75F05 (AC011293), KIAA1470P1 from BAC clone 
RP11-498M14 (AL445675), ADLY from BAC clone RP11-333E09 (AC011302), ARSFP 
and RPS24P1 from BAC clone CITB-144J01 (AC004772), RPS24PX from BAC clone 
RP11-418N20 (AC1 19620), ASSP6 from BAC clone RP11-461H06 (AC012502) and 
ASSP4 from BAC clone GS1-536K07 (AC004616). Sequences for all other genes were 
obtained from published cDNAs, whose GenBank accesion numbers are as follows: ADLX 
(AF245505), ARSF (XM_035467), RPS24 (NM 033022), ASS (X01630), KIAA1470 
(AB040903). THC604695PY was not analyzed as only part of its most terminal exon 
(consisting almost entirely of 3'UTR) was available for comparison with the 
X-chromosomal EST cluster (AA662182 and AA662138). 
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Results 

Ma pping of interstitial deletions 

We studied the DNA of nine adult males which originally consulted reproduction centers 
about idiopathic infertility, but were otherwise generally healthy. Of the 9 males, 7 were 
unremarkable with respect to adult height One patient, #293, with a height of 157cm, 
presented short stature (SDS -2.9) and one, Y0308, with a height of 165.5cm showed 
borderline height, being at the 3 rd percentile of normal U.S. height standard (SDS -1.7). 
Adult height of his parents and siblings are in the normal range (Table 1), his brother being 
20.5cm taller than the patient Compared to his target height (178cm) and target range 
(169-1 87cm) he can be considered short All men were ascertained solely on the basis of 
the occurrence of large de novo interstitial deletions on the Y chromosome. Only two of 
those patients had undergone previous chromosomal studies. 

In our effort to localize the GCY locus, we focused on that part of the Y chromosome long 
ann, which was delimited by the boundaries of the interstitial deletions of the patients with 
short stature (Fig. 1). Recently, a detailed physical map of the human Y chromosome 
incorporating 758 ordered STSs and 199 completely sequenced BAC clones has been 
constructed (Tilford et al. 2001). We used a slightly modified PCR multiplex system 
(Henegariu et al. 1994) to test the absence or presence of 28 DNA loci from the Y 
chromosome long arm. In patients where sufficient DNA was available for further PCR 
analysis additional STSs were tested As a result, 8 of 9 interstitial deletion breakpoints 
could be positioned (Fig. 1). As the deletions of patients JOLAR, #28, #63, #95, TM., and 
#1947, all with normal height, overlap, most of the long arm of the Y chromosome could 
be excluded as a critical region for GCY. 

As the distal breakpoint of the deletion of patient #1972 does not reside within the specific 
part of the Y chromosome long arm, the nature of the deletion (terminal or interstitial) 
remained unclear. There was also no overlap of his deletion with the deletions of patients 
#1947 and T.M. Relying solely on the results obtained by the STS-based interstitial 
deletion mapping strategy, one could not formally exclude the region distal to sY158 as a 
potential critical region for GCY. However, multiplex PCR analysis always showed a less 
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intense amplification product for STS sY157 (a Y-derived marker in close vicinity of 
sY158). To address mis problem, the rearranged Y chromosome of patient #1972 was 
investigated in more detail. 

Fluorescence in situ hybridization and sear™ "™ family variant typing of patient 
#1972 

The overall integrity of the Y chromosome from patient #1972 was demonstrated by FISH 
of the cosmids LLOYNC03"M"34F05 (PARI) and LLOYNC03"M"49B02 (PAR2) as well 
as the Y-centromere-specific probe Y-97 and the telomere-specific probe 'all human 
telomeres' (data not shown). Being aware of the complex structural organization of the 
human DAZ locus (Fig. 2A), we specifically searched for sequence family variants (SFVs). 
To prevent misjudging sequence errors as single nucleotide differences, 
PCR/restriction-digestion assays were developed only from SFVs present in at least two 
overlapping BAC clones. The localization of these SFVs is shown in Fig. 2B. As these 
SFVs could represent allelic variants, ten unrelated normal German males were typed. In 
all cases, the expected fragment pattern could be detected for the Y-chromosome derived 
sequences. In contrast, the fragment pattern deduced from the genomic sequence of the 
chromosome 1-derived BAC clone RP1 1-560118 could not be confirmed (see Table 3 for 
detail). Each SFV-specific PCR/restriction digestion was compared to the presence/absence 
in the corresponding BAC clones. 

Typing the genomic DNA of patient #1972 for all four sequence family variants 
(SKY10/Tsp5091, SKYll/NlaHI, SKY12/MseI, and SKY13/Cac8I + Tfil) revealed the 
absence of one Y-derived non-allelic sequence variant (Table 3 and Fig. 2C,D). In the case 
of SKY10 the distal copy is deleted. Not surprisingly, in all other typing experiments the 
more proximal copy of the respective SFVs was shown to be deleted. 

Next, we investigated these SFVs in the two patients with the most distal breakpoints (#95 
and #1947). Using genomic DNAs, we determined that both non-allelic variants of SKY1 1, 
SKY12, and SKY13 and one non-allelic variant of SKY10 were absent in patient #1947, 
whereas for all tested SFVs one non-allelic variant was absent in patient #95. 
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Taken together, these results provide evidence that the proximal breakpoint of the 
interstitial deletion present in the Y chromosome of patient #1972 resides within the 
interstitial deletion of patient #1947, thereby excluding this genomic region as a potential 
critical interval for GCY. 



Refinement of the GCY critical interval 

Based on the molecular analysis of the pericentric region of the long arm of the human Y 
chromosome (Williams and Tyler-Smith 1997), the physical extension of the GCY critical 
region as defined by the markers sY78 (DYZ3) and sY83 (DYS11) was estimated to 
constitute 1.6-1.7 Mb (Fig. 3A) of DNA. The most proximal 400 kb of this region consist 
exclusively of 5bp satellite sequences separated from the Y centromere only by Alu 
sequences. This constant part of the human Y chromosome is therefore unlikely to contain 
coding sequences. The remainder of the GCY critical region is composed of 
X/Y-homologous as well as autosomal/Y -homologous sequence blocks. At the onset of this 
study, only limited coverage in YAC clones was aivailable for this region. In order to refine 
the GCY critical interval and to generate gene finding substrates, it was necessary to 
establish a BAC/PAC-contig of this region. 

We generated 25 additional markers mainly by sequencing the end fragments of BAC, 
PAC, and YAC clones as well as clone-internal sequences amplified by various 
combinations of Alu-Alu oligonucleotide primer pairs. Of those, only 7 turned out to be 
Y-specific (SKY1, SKY2, and SKY4-8) (see Table 2 for detail). The BAC and PAC clones 
identified during the generation of the physical map are summarized in Table 4. 
Meanwhile, some of these clones have been completely sequenced as they form part of a 
tiling path for sequencing the human Y chromosome (Tilford et al. 2001). The proximal 
part of the cloned region between markers sY78 and SKY6 has not been sequenced to date. 
A selection of clones covering the entire GCY critical region is depicted in Fig. 3. 

Confirming the overlap between BAC RP11-295P22 and BAC RP1 1-322K23 appeared to 
be the most crucial step in the process of contig construction. Y-specific markers derived 
from the opposite end fragments of both clones wore suspected to amplify identical-sized 
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fragments from two different loci within the same 5bp satellite region. By testing several 
restriction enzymes known to cut frequently within 5bp satellites composed of the 
consensus sequence (TGGAAX we developed loci-specific PCR/restriction digestion 
assays. Typing all BAC clones mapping to this sequence block with the appropriate 
PCRAestriction digestion assay allowed us to precisely position them thereby c onfirming 
their overlaps. 

In order to narrow down the critical interval for the GCY gene, we tested for the presence 
of the newly generated STS in patients #293, Y0308, and JOLAR. These results allowed us 
to define a small region for the GCY gene (Fig. 3 and Fig. 4). Direct sequence comparison 
showed that the sequenced BAC clones RP11-322K23, RP11-75F05, RP11-461H06, 
RP11-333E09, RP11-558M10, CITB-298B15, and CITB-203M13 completely cover the 
mapped region between Y-STSs SKY8 and sY83 (DYS11), suggesting that it encompasses 
roughly 700 kb. Basically, the region can be subdivided in three distinct intervals: a 
proximal region characterized by 5bp repeats, a central region with high homology to 
chromosome 1, and a distal region composed of X/Y-homologous sequences. As the most 
distal part of the GCY critical region (beginning with bpl of BAC clone CITB-144J01) 
was already subject of extensive research during the process of characterization of the 
AZFa critical region and was shown to harbour no functional gene (Sargent, et al. 1999), it 
was excluded from further detailed genomic DNA analysis. The most proximal part of the 
GCY critical region consists exclusively of satellite type 3 sequences of the 5bp consensus 
(TGGAA) n and is therefore also not assumed to contain any gene. Leaving these two 
regions out of consideration, we were able to concentrate our efforts to a smaller interval of 
420 kb of DNA Large-scale sequence comparisons performed by the Advanced PipMaker 
software showed no integration of Y-specific sequences into the chromosome 1 and/or 
chromosome X-homologous regions. 

We have also established new Y-specific markers scattered uniformly across the entire 
420Kb of DNA (Tab 5). 
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Exon trapping in the GCY critical region. 

The boundaries of GCY region are defined by two deletion patients, JOLAR and Y0308 
(Fig. 3). PAC clone, RP1-148J07, extends into a genomic segment exclusively composed 
of 5bp repeats of the satellite 3 type. The very distal PAC clone, RP1-83D22, was not 
included in the experimental analysis, as the region distal to sY82 was previously analyzed 
in the course of defining the transcriptional potential of the AZFa region (Sargent et al. 
1999). To identify transcripts that might encode GCY, we used 5 PAC clones from the 
GCY region as substrates for exon trapping (RP1-148J07 up to RP1-1 14A1 1, Fig. 4). Each 
of the 5 PAC clones from the GCY region was individually subcloned and subjected to 
exon trapping. Nucleotide sequencing of trapped products identified 9 different exon trap 
clones, two of them were composed of two exons (Fig. 4, Tab. 6). All exon trap clones 
were isolated in several copies. Exon/intron boundaries of all 11 putative exons matches 
the splice site consensus. Trapped products that mapped to the GCY region were verified 
using PCR by their presence versus absence in males and females and GCY-deleted males 
with short stature. All exon trap clones revealed only one male-specific fragment on 
Southern blots. 

In silico analysis of annotated BAC clones. 

We analysed the genomic sequence of the complete GCY region using the gene prediction 
programs assembled by the NIX and Rummage software packages. Homologous sequences 
were also analysed in the non-redundant (nr) database of GenBank using the BLASTN or 
FASTA algorithm. BAC RP1 1-75F05, for example, includes a 1Kb segment with a 77% 
homology to the transcriptional unit KIAA1470 on chromosome lp36 (Fig. 5). On BAC 
RP11-461H06 and CITB-144J01, for example, sequences of 2.5 and 1Kb length showed a 
88% and 81% homology with the genes ASS and RPS24 on chromosome 9q34 and 10q22, 
respectively. The Y-chromosomal copies ASSP6 and RPS24P1, however, represent 
pseudogenes and have a progenitor on Xp22 that has been translocated to the Y 
chromosome. Two pseudogenes on RP11-333E09 and CITB-144J01, THC604695PY and 
ARSFP, represent deleted copies of Xp22 specific genes. 

BAC RP11-333E09 includes a deleted diylication (ADLY) of the adlican gene on 
chromosome Xp22 (ADLX). ADLX has been previously shown to be upregulated in 
osteoarthritic tissue and therefore likely plays a role in bone metabolism. The Y 
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chromosome copy, therefore, constitutes an important candidate for a gene involved in 
growth. Despite the loss of exons 3 and 4 as a consequence of intrachromosomal 
recombination, its basic structural organization (Fig. 7) and sequence homology to ADLX 
(Tab. 9a) could still allow to encode a functional protein with similar molecular properties. 
This observation was enforced by a unified predicted gene model of ADLY by all 
gene-finding programs (cfl; Fig. 4). Taking the functionality of the predicted ADLY 
promoter for granted and assuming ADLY would start at the ATG codon also used on the 
X chromosome, an in-fiame stop codon at position +359 would result in premature 
tennination. One additional promotor was predicted in the sense strand of the last intron of 
ADLY. There is, however, no obvious correlation between the promoter position and the 
significance for potential ADLY expression. 

Using various gene-finding programs we detected 17 gene models in the GCY region (Fig. 
4f). Only five (arl, cfl, crl) overlapped with transcriptional units identified by homology 
search. Conceptual translations of 14 models revealed no protein matches. With respect to 
location and orientation promoters predicted by FirstEF could be assigned to KIAA1470P, 
ADLY, RPS24P1, and ARSFP. 

In conclusion, there is no identity of exon trap clones and gene models/homologies or 
pseudogenes KIAA1470PY, ASSP6, and THC604695PY. Considering ADLY as the most 
attractive candidate for the GCY locus, we directly compared the exon/intron boundaries of 
the Y- and X-derived copy (Tab. 9b). Exons 3 and 4 of ADLX are deleted on the Y copy. 
The remaining 3 internal exons still possess correct 5 C and 3* splice sites. 

Searching for a transcriptional unit 

Homology searches performed with all exon trap clones and predicted gene models against 
the dbEST segment of GenBank did not yield any Y-specific EST. PCR and 
PCR/restriction digestion assays with primers corresponding to all putative transcriptional 
units were carried out. Primers derived from all exons of ADLY (Tab. 7B, 7C), the most 
prominent GCY candidate, were used to screen reverse-transcribed polyA + -KNAs from 
osteosarcoma and bone marrow fibroblast cell lines. Whereas ADLX was shown to be 
expressed in all tested cell lines (with the exception of neuronal tissues), no ADLY specific 
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specific transcript was detectable. More extensive screening of polyA + -KNAs from various 
adult and fetal tissues basically led to the same result We also tested all putative 
transcriptional units in the GCY region for expression in pofyA + -RNAs from 21 tissues and 
49 cDNA libraries. RT-PCR assays did not provide proof of a transcribed gene. 

Evolutionary features of the GCY critical region. 

High sequence homology of the Y chromosome to other chromosomal regions is consistent 
with an evolutionarily recent transposition of those regions to the Y chromosome. More 
subtle nuances in synonymous nucleotide divergences of homologous gene pairs QQ allow 
their integration into distinct evolutionary strata, group 1-4 (Lahn and Page 1999). The 
calculated K, values for all gene pairs in the GCY region along with K, values from 
reference genes of the different stratas are given in table 6. We noted that the K. values for 
all X-Y gene pairs can be grouped into the most recent evolutionary stratum (group 4), 
having been embarked on X-Y differentiation 30 to 50 million years ago. This 
classification is independent of the actual functional state of X-chromosomal genes. 
Comparing K, values between the Y-copies in the GCY region and their functional 
progenitors clearly demonstrates that decay of the X-chromosomal copies took place before 
the X-Y recombination occurred. Even more prominent is the difference between K« values 
for the chromosome 1 - chromosome Y gene pairs. The low K, value for the 
KIAA1470P1/KIAA1470PY gene pair points towards a very recent transposition to the 
human Y (Fig. 5). Supporting evidence comes from fluorescent in situ hybridization in 
primates delineating this event to a time period of about 5 to 6 million years ago (Wimmer 
et al. 2002). The K. value for the comparison of KIAA1470PY with its functional 
progenitor in lp36 date the underlying intrachiomosomal transposition roughly to about 
150-170 million years ago. 

As the frequency of nonsynonymous substitutions (K*) is a function of both evolutionary 
time and selective constraints on the encoded proteins, the degree of constraint can be 
reflected in the ratio Ks/Ka (Li, 1993): Values greater than one indicate the presence of 
constraints on both homologs, and values in the vicinity of one are consistent with lack of 
constraint on at least one homolog. All determined K/Ka ratios suggest that natural 
selection on the Y copies is not ongoing thereby underlining their pseudogene status. 
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We searched the nr database of Genbank with the homology transitions and the distal 
border of the GCY region to precisely determine the physical extent of the homologous 
regions on chromosomal subintervals lq43 and Xp22. To identify highly conserved 
segments, we used Advanced PipMaker (Schwartz et al. 2000, ht£://bio.ces.psu.edu) for 
comparing the corresponding DNA. Inspection of the compound dot plot allows the 
identification of those portions of the GCY region absent in homologous sequences. As the 
overall homology of Y/l and Y/X in conserved regions is already in the range of 94-97% 
and 96-99%, putative protein-coding exons are not expected to show average percent 
identities higher than the non-coding environment Careful dot plot analysis showed that all 
novel sequences that have accumulated in the GCY region on the Y after the separation 
from its autosomal or X-chromosomal counterpart are exclusively of repetitive origin. 
Particularly evident is the prevailing preponderance of integrated UNEs family members. 

Discussion 

Since the issue on the existence of a Y-specific growth gene (GCY) was first raised, there 
have been several attempts to define its precise location. Whereas initial studies 
unanimously pointed towards a common region of the Y chromosome long arm (Salo et al. 
1995), more recent investigations have led to the identification of two non-overlapping 
critical intervals (Rousseaux-Prevost et al. 1996, Ogata et al. 1995, De Rosa et al. 1997). 
FISH analyses resolved this apparent contradiction by presenting clear evidence that the 
patient materials used in these. initial investigations contained 45,X0 cells and/or i(Yp) or 
idic (Y ql 1) chromosomes (Kirsch et al. 2000). Both genetic parameters influence the adult 
height of a given individual, thereby rendering it impossible to predict whether such 
patients have lost GCY or not Studies with patients carrying de novo interstitial deletions 
are, therefore, much better suited to address the problem of GCY localization. 

In the course of winnowing the literature for patients with small interstitial deletions, in 
particular close to the centromere, it became clear that those patients are very rare. This 
prompted us to extend our search for patients carrying large de novo interstitial deletions, 
irrespective of their actual adult height. We examined 9 adult patients, 7 of whom 
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presented normal height Futhennore, we could show overlapping deletions, thereby 
excluding GCY to reside between the Y-specific marker DYS1 1 and the pseudoautosomal 
region 2 (PAR2). Two patients, #293 and Y0308, presented interstitial deletions enabling 
the restriction of the GCY critical region to approximately 700 kb of DNA. This region is 
therefore predicted to harbour one or more genes required for normal human growth. 

Exon amplification and gene modeling in the GCY region. 

Although much attention has been drawn to the various azoospermia (AZF) critical regions 
in Yqll as well as Y-encoded testis-specific or ubiquitously expressed genes, the GCY 
region up to now was not searched systematically for transcription units. We have used 
exon amplification, homology search, and in silico gene prediction to identify putative 
genes within this region. This information now provides the means to test candidate genes 
for involvement in human linear growth regulation. Up to date, the major problem in 
defining the GCY gene was the lack of potential transcription units assigned to this portion 
of the human Y chromosome. Prior to this study, there were only two pseudogenes, 
RPS24P1 and ARSFP, that mapped to the GCY critical region (Sargent et al. 1999). 

By exon amplification we isolated 9 different exon trap clones, two of which were 
composed of two exons. Parallel sequencing efforts of foe GCY region by the Human 
Genome Project allowed us to complete our catalog of potential transcription units in the 
GCY region. No Y-specific ESTs were assigned to the region. The Nix and Rummage 
software programs were used to analyze sequence data of completed BACs to predict 
potential genes in the sequence. We have identified 4 new genes/pseudogenes and 17 gene 
models. Of the 17 gene models, only five have homologies to the identified 
genes/pseudogenes. A gene model homologous to ADLY (cfl) was uniformly predicted by 
all gene-finding programs. Though, the probability given by various gene finding programs 
might be overestimated with regard to the gene model cfl . Very large exons, as present in 
ADLY, are less likely to be predicted correctly, but they are most unlikely to be completely 
missed. Consequently the tendency to classify actual pseudogenes as functional genes 
increases with the presence of large exons. The failure to trap exons of the putative ADLY 
transcription unit, albeit possessing correct splice sites, might be an intrinsic feature of 
Y-chromosomal sequences. Complete representation of the AZFc region in cosmid/Pl 
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clones used for exon-trapping experiments (Reijo et al. 1995) led to the detection of DAZ 
as the only gene out of a possible 8 genes/gene families located in this region 
(Kuroda-Kawaguchi et aL 2001). 

Surprisingly, we observed no concordance between the gene models and the exon trap 
clones. It is possible that exon amplification is dependent on the presence of functional 
splice sites in the genomic sequence whereas gene modeling is mainly based upon the 
in-phase hexamer measure (Rogic et al. 2001), a method determining the incidence of 
oligonucleotides of length six in a specific open reading frame. On the other hand, the 
prediction of correct splice sites is less important since such signal sensors have low 
information content and are usually degenerate. Consequently, the exon tnq? clones need 
not to be necessarily part of one of the predicted gene models, although a substantial 
fraction of the trapped exons (7/11) are composed of 75 to 200 nucleotides, a length range 
in which exons are most accurately predicted. Likewise, the putative exons assembled to a 
distinct gene model do not necessarily represent real exons. 

It is possible that the eventual number of genes in the GCY region is smaller since exon 
trap clones and/or gene models turn out to be part of the same transcripts or do not 
represent genes at all. Despite the number of potential transcription units in the region, 
however, the search for the critical one might still be complicated by the fact that the 
phenotypic effect caused by mutation of the GCY locus is hard to be defined precisely. 
This makes it difficult to predict an expression profile, especially when the gene function is 
unknown. Since human linear growth is a multifactorial trait, growth failure is quite 
common.. Although at least nine growth-controlling genes have been identified up to now, 
only few cases present disease-causing mutations within those genes. Definition of the 
transcription units in the region should now facilitate mutation studies, especially since 
full-length genes/pseudogenes have been isolated. . 

Although reverse-transcribed polyA + -RNAs and cDNA libraries have been extensively 
screened, we have not detected any transcript specific to the Y. This raises the question 
whether our approach was suitable. To assess its usefulness we have verified the 
expression pattern of 20 genes known to be essential for bone development at GenePage 
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(http://genome-www5 .stanfbrd.edu). At least double presence for each selected gene was 
warranted by our screening efforts. This corroborates the existence of an unusual gene with 
an extremely confined spatial and/or temporal expression pattern. 

Evolutionary features as a clue to the GCY locus? 

To gain more insight into the molecular genesis of the GCY critical region, we used two 
methods. First, we validated the functional state of the genes/pseudogenes within the GCY 
region by comparing them with their direct and functional progenitors. All gene pairs 
showed Ks/K. ratios of 1 to 2 rather indicating that the Y copy is a pseudogene. This result 
assigns the X-Y gene pairs to evolutionary stratum 4 which fits very well since all those 
gene pairs share a common evolutionary history. Only one gene pair out of this class, 
AMELX/Y, still encodes a functional X- and Y-copy (Salido et aL 1992). The Y-copy of 
KIAA1470 clearly could be classified as a pseudogene by comparing it with its functional 
progenitor on lp36. Second, we made use of large-scale sequence comparison in order to 
identify potential differences between the subintervals of the GCY region and then- 
homologous counterparts in Xp22 and lq43. Neither subregions with a conservation level 
above the molecular environment nor small genomic fragments newly integrated into the 
GCY critical region could be detected. Furthermore, promoter prediction carried out 
simultaneously on homologous genomic sequences revealed no differences. This clearly 
excludes substantial rearrangements within the GCY critical region and lends support to a 
gene underlying male-specific regulatory mechanisms. 
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Table 1 Adult height comparison of patients and their siblings 



Case 


Country 
of origin 


Height of patient (cm) 
and standard deviation 
score 


National 
height . 

OUUIIltM la 

(cm) 


Heights of family 
members (cm) and 

era Tl no Vim it 0t7l q hnn 

score 


#293 


U.S.A. 


157(SDS-2.9) 
short 


176.9 
(SD6.8) 


(P)170 
(M) normal 
(B) normal 


Y0308 


U.S.A. 


165.5 (SDS -1.7) 

hondcTitufi 

VUiWwlllllV 

(short?) 


176.9 
(SD 6.8) 


(F)170 

(M) 168 ! 
(B) 188 (SDS +1.7) 
(S) 170 (SDS -0.4) 


JOLAR 


United 

tt m Off i\tt% 


168 (SDS -1.0) 

firtrmsil 
muuicu 


Y1A.1 


(F) normal 
rivn normal 
(B) normal 


#28 


Italy 


175 (SDS -0.3) 
normal 


176.1 
(SD 65) 


(F) normal 
(M) normal 


#63 


Ethiopia 


170 (SDS 40 J) 
normal 


168.0 
(SD7.4) 


(F) normal 
(M) normal 


#95 


Israel 


185 (SDS +1.4) 
normal 


175.6 
(SD6.8) 


(F) normal 
(M) normal 


T.M. 


Belgium 


182 (SDS +13) 
normal 


173.5 
(SD6.7) 


(F) normal 
(M) normal 


#1947 


Germany 


175 (SDS -0.8) 
normal 


179.9 
(SD6.4) 


(F) normal 
(M) normal 


#1972 


Germany 


181 (SDS +0.2) 
normal 


179.9 
(SD6.4) 


175(F) 
165CM) 

172 (S) (SDS +1.0) 



. The standard deviation score (SDS) was calculated based on die equation: SDS = (X-MJ/SD, where X is an 
individual's adult height and M and SD are die mem adult height and die ± 1 standard deviation of die normal 
population, respectively. ' ■ 

(M)molher t (F) Either, (S) sister, (B)brother, (NA)n<rt available. 
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Table 2 Y-chromosomal STSs 
Left Primer RJgjit Primer Product 



SKYl 


GGACATTTGGCTGCAGAGAT 


TGGCAATGCACrCTCATCAT 


255 


SKY2 


TCAGGACAGACAOGCTGCTA 


CCTGCCACTGAGCTCCTTAC 


-1700 


SKY3 


TTCTCCCTCATCTTCCAAGC 


GCTTCCATCCATTAGCAAGG 


167 


SKY4 


(XnTTCATTCCATTCrCTTCCA 


CGCACTTTATGGACTGCAA 


111 


SKY5* 


CCCTCXJKXATriUriTlXiA 


CCTCGAATTTAATGGATTGC 


202 


SKY6* 


TCAATGGATGCACAGTGTGGC 


TOCACTGAATTCCATTGCAC 


328 


SKY7 


GGGAGTGCAAAGGGAAAGAT 


CirrCCATGGGGTGACATTC 


223 


SKY8 


CCATTCATTOGAGTTCATTAOO 


ATTGGAATGGAATCGGACAG 


189 


SKY9 


GGCCGATGGTCAAACTGTTA 


GAAACXKjGCTCTGAAATTCT 


531 


SKY10* 


ATAAGGGGCAGGTTTGTCAC 


GCTACTTAITCAGTGTTTAACTGACAC 


329 


SKY11* 


AAAGTGGGTGAAGGACATGG 


TTTTTGTTTGTGGCAGGTG 


469 


SKY12* 


TTGAGTCACTGGGGATAACTG 


TATGGCCCACAATCACTTCA 


216 


SKY13* 


GGCAGCCTAGAAAGTCTTGTTC 


CCCTTGGGATnTGTCTGTT 


198 



Markers indicated witii a * amplify DNA fragments from more than one genomic locos (sec Chapter Restriction analysis o/PCR 
products for detail). 
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Table 4 Summary ofBAC^and PAC clones identified daring physical map creation 



Y-STSs 



Positive BACsfRPCI 11) 



Positive PACs (RPCI1, 3-5) 



sY83 
sY82 
OY8 

sY81 

14A3C* 

sY79 

SKY1 
SKY2 



SKY4 
SKY5 



SKY6 



not screened 
not screened 
not screened 

not screened 

not screened 

75F05, 79B14, 102024, 322K23, 
417D23, 600DU, 612E10, 725112, 
863108, 903M02, 1 125H21 

376816,544011, 544M21 
79P12, 295P22, 376120, 828024, 
886111,910006 



831322 

83D22, 114AU, 157G08, 966C15 
114 All, 168E21, 271D03, 635F21, 
765H16, 806O15, 904E13, 966C15 
301P22, 1079J08, 1078C20, 
I160A12 

148J07, 1136A14, 1160A12, 

1196123 

1149H11 



56A05,85D24,958M03 
829HB8 



not screened 
notsoeened 



75F05,322K23,612E10 
174124, 271E18, 295P22, 588E18, 
620X20, 632FU, 684H19, 705019 
174124, 271E18, 295P22, 588E18, 
620120, 632F11, 684H19, 7&5019 

* 14A3C is a hybridization probe previously described by Tyler-Smith et aL 1993. It detects a Y-specifiC 
Huufln-fragment of 3.5 kb and an additional autosomal fragment. 
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Table 7B Primer pairs for Y copy of Adtican 



rnmersequence (5 — > 3 J 


Direction with respect to putative 
transcription orientation 


primer 


GACTCCTGGCPTTfiA PTTOA 


forward 




AdIYExl 


TCTCTGTGGTGCTGATCCTG 


forward 




cfl 


GGAGGAGCAAAAACAAGAAGAGA 


forward 




cfl-117 


ACTGATGAGCACGGGAACC 


forward 




cfl -205 

vll ^bUJ 


TCCATCCTGAAAGTGCCTO 


forward 




C17c 


ACATGTATACATGCTGCCAA 


forward 




C18 


CAGCGAAGGAAAGCACATTT 


forward 




AdIYEx5 


GGCGACCTGAAGGGGACT 


forward 






CTGTCCAGTCCTCAGGAAGC 


forward 




£21 


GAAGCATCCACCAAAGCG 


forward 




cfl-4679 


ACAGCGGGCGCTATGAGT 


forward 




cfl -4a 








CAGGATCAGCACCACAGAGA 


reverse 




AdIYEx2 


CTGGGGAAGTTGGATTTTCTC 


reverse 




C17b 


ACXIAGGTTCCOGTGCTCA 


reverse 




cfl -227 


GCAAGAATCTGGGCTCTCAC 


reverse 




cfl 


ACTGTGATTCCCACCGTGAT 


reverse 




C17c 


TTGTTTTGAGGAACGCCTCT 


reverse 




C18 


GGATGTGGGATCTGGTGAG 


reverse 




cfl -2079 


GGGTGTAATTTTCTCCCATTG 


reverse 




AdlYEx5 i 


CGTCCGTTTCAGCAGTGACA 


reverse 




cfl -4810 


CTGACGTCCGTCCTCTGC 


reverse 




cfl -4b 


ATGGACAGTGATCCGGTTTC 


reverse 




cfl-6453 


TGAGCTGCACGATCAACCTC 


reverse | 


cfl-6559 
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Claims 

1. An isolated region of the Y chromosome between SKY1 and sY83 which 
encompasses the Y-specific growth gene GCY. 

2. An isolated region according to claim 1 which is about 700 Kb in size. 

3. An isolated region according to claim 1 or 2 in which the Y chromosome is a 
human chromosome. 

4. An isolated region according to claims 1 or 3 which is between SKY8 and sY83. 

5. An isolated region according to claims 1 or 3, which is between sY79 and sY8L 

6. An isolated GCY protein, encoded by a region of the Y chromosome within the 
interval SKY1 and sY83. 

7. An isolated GCY protein according to claim 6 encoded by a region within the 
interval SKY8 and sY83. 

8. An isolated GCY protein according to claim 7 encoded by a region between sY79 
andsY81. 

9. An isolated GCY protein according to claim 8, which is ADLY or a functional 
fragment thereof 

10. A nucleic acid primer having a nucleic acid sequence selected from a nucleic acid 
sequence as shown in Tables 2, 5, 6, 7A, 7B, 7C or 8. 

11. A method of studying GCY localisation or identifying a GCY gene associated with 
height comprising the use of a primer according to claim 1Q to selectively amplify or detect 
a region of a nucleic acid molecule. 

12. An isolated protein having greater than 65% homology to the GCY protein of 
claims 6-9, and which contributes to the sex-related height difference in humans. 
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13. Use of a nucleic acid molecule comprising at least a portion of the isolated region 
of the Y chromosome between markers SKY8 and sY83, or a sequence complementary 
thereto, to identify the presence or absence of a GCY gene associated with height 
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Figure 4 
GCY region AZFa region 
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<150> 


GB0215188.4 


<151> 


2002-07-01 


<160> 


167 


<170> 


Patentln version 3.2 


S_ ±. \J 


i 
J. 


<211> 


20 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Primer: SKY1 left 


<400> 


1 


ggacatttgg ctgcagagat 


20 


<210> 


2 


<211> 


20 


<212> 


DNA 



<213> Artificial 
<220> 

<223> Primer: SKY1 right 
<400> 2 

tggcaatgca ctctcatcat 
20 



<210> 3 

<211> 20 

<212> DNA 

<213> Artificial 



Pano 1 



<220> 

<223> Primer: SKY2 left 
<400> 3 

tcaggacaga caggctgcta 
20 



<210> 4 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY2 right 
<400> 4 

cctgccactg agctccttac 
20 



<210> 5 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY3 left 
<400> 5 

ttctccctca tcttccaagc 
20 



<210> 6 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY3 right 
<400> 6 

gcttccatcc attagcaagg 
20 



<210> 7 

<211> 22 

<212> DNA 

<213> Artificial 



<220> 

<223> Primer: SKY4 right 
<400> 7 

cctttcattc cattctcttc ca 
22 



<210> 8 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY4 right 
<400> 8 

cgcactttat ggactgcaa 
19 



<210> 9 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY5 left 
<400> 9 

ccctcgtcca tttcttttga 
20 



<210> 10 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY5 right 
<400> 10 

cctcgaattt aatggattgc 
20 



<210> 11 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Primer: SKY 6 left 
<400> 11 

tcaatggatg cacagtgtgg c 
21 



<210> 12 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY6 right 
<400> 12 

tccactgaat tccattgcac 
20 



<210> 13 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY7 left 
<400> 13 

gggagtgcaa agggaaagat 
20 



<210> 14 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY7 right 
<400> 14 

ctttccatgg ggtgacattc 
20 



<210> 15 

<211> 22 

<212> DNA 

<213> Artificial 



<220> 

<223> Primer: SKY8 left 



<400> 15 

ccattcattc gagttcatta eg 
22 



<210> 16 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY8 right 
<400> 16 

attggaatgg aatcggacag 
20 



<210> 17 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY9 left 
<400> 17 

ggccgatggt caaactgtta 
20 



<210> 18 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY9 right 
<400> 18 

gaaaeggget ctgaaattct 
•20 



<210> 19 

<211> 20 

<212> DNA 

<213> Artificial 
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<220> 

<223> Primer: SKY10 left 



<400> 19 

ataaggggca ggtttgtcac 
20 



<210> 20 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY10 right 
<400> 20 

gctacttatt cagtgtttaa ctgacac 
27 



<210> 21 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY11 left 
<400> 21 

aaagtgggtg aaggacatgg 
20 



<210> 22 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY11 right 
<400> 22 

tttttgtttg tggcaggtg 
19 



<210> 23 

<211> 21 

<212> DNA 

<213> Artificial 
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<220> 

<223> Primer: SKY12 left 
<400> 23 

ttgagtcact ggggataact g 
21 



<210> 24 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY12 right 
<400> 24 

tatggcccac aatcacttca 
20 



<210> 25 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY13 left 
<400> 25 

ggcagcctag aaagtcttgt tc 
22 



<210> 26 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: SKY13 right 
<400> 26 

cccttgggat tttgtctgtt 
20 



<210> 27 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> Primer: A72 forward 
<400> 27 

atttccaccg aaacccattt 
20 



<210> 28 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: A72 reverse 
<400> 28 

ctcccctacc accaacacac 
20 



<210> 29 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: A92 forward 
<400> 29 

agggccctca catgattaaa 
20 



<210> 30 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: A92 reverse 
<400> 30 

gcgacaccat ttctttccat 
20 



<210> 31 

<211> 20 

<212> DNA 

<213> Artificial 
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<220> 

<223> Primer: Alll forward 
<400> 31 

gacatcgtgg tgtctgttgc 
20 



<210> 32 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: Alll reverse 
<400> 32 

cagacgttgt tcaggtcgtg 
20 



<210> 33 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: A134 forward 
<400> 33 

gcaccattag tgcgcttgt 
19 



<210> 34 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: A134 reverse 
<400> 34 

ttctcccttt accccaaatt c 
21 



<210> 35 

<211> 20 

<212> DNA 

<213> Artificial 



Paae 9 



<220> 

<223> Primer: A158 forward 



<400> 35 

ccagcaggag tcttggagtc 
20 



<210> 36 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: A158 reverse 
<400> 36 

tgagaggcac ctacggttag a 
21 



<210> 37 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: B17 forward 
<400> 37 

ccaagcatgc cttcctaaag 
20 



<210> 38 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: B17 reverse 
<400> 38 

tgccttctca tctgcttgtg 
20 



<210> 39 

<211> 20 

<212> DNA 

<213> Artificial 
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<220> 

<223> Primer: B37 forward 
<400> 39 

atcctgggag atgcatcaga 
20 



<210> 40 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: B37 reverse 
<400> 40 

tgagtcctaa accgtacaca taca 
24 



<210> 41 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: B59 forward 
<400> 41 

caatggaaat gttgcag.gtg 
20 



<210> 42 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: B59 reverse 
<400> 42 

tcctgccctg ctgttagagt 
20 



<210> 43 

<211> 20 

<212> DNA 

<213> Artificial 



Pace 11 



<220> 

<223> Primer: B82 forward 
<400> 43 

gcaagggtgt tgcaagttta 
20 



<210> 44 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: B82 reverse 
<400> 44 

tgcatattgt ccacacatgg 
20 



<210> 45 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: B102 forward 
<400> 45 

aaagagaagg gccctgtgat 
20 



<210> 46 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: B102 reverse 
<400> 46 

ctaggcaaca gcactggaaa 
20 



<210> 47 

<211> 19 

<212> DNA 

<213> Artificial 
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<220> 

<223> Primer: C17 forward 
<400> 47 

aaaatccact tccccagtg 
19 



<210> 48 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C17 reverse 
<400> 48 " 

gcaagaatct gggctctcac 
20 



<210> 49 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C37 forward 
<400> 49 

cactggggaa ggctgtgata 
20 



<210> 50 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C37 reverse 
<400> 50 

cattgtcatc actgccaggt 
20 



<210> 51 

<211> 21 

<212> DNA 

<213> Artificial 



Paqe 13 



<220> 

<223> Primer: C56 forward 



<400> 51 

cccacttctt ctccaaagtc c 
21 



<210> 52 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C56 reverse 
<400> 52 

gcacccgttt tcctgatcta 
20 



<210> 53 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C77 forward 
<400> 53 

ggggcatatt ctacacacca a 
21 



<210> 54 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C77 reverse 
<400> 54 

tgaaatggca aacctttcag a 
21 



<210> 55 

<211> 22 

<212> DNA 

<213> Artificial 



Pace 14 



<220> 

<223> Primer: C97 forward 
<400> 55 

aagaatggaa ggatctccaa ga 
22 



<210> 56 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C97 reverse 
<400> 56 

tctgtgcaga aatgatggat tc 
22 



<210> 57 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C120 forward. 
<400> 57 

tggtagtggg aaactgctca 
20 



<210> 58 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C120 reverse 
<400> 58 

tggtgtgcta agtggctgtc 
20 



<210> 59 

<211> 23 

<212> DNA 

<213> Artificial 
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<220> 

<223> Primer: C142 forward 
<400> 59 

gctgcagtta gctaaaccaa gac 
23 



<210> 60 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C142 reverse 
<400> 60 

attctgcctg aacctccaga 
20 



<210> 61 

<211> 61 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon trap clone: et_a_001 reverse 
<400> 61 

ggtctttggc tcaactcagg ttccctctac ctgaaatgat ccaccttcag agaattggat 
60 

g 

61 



<210> 62 

<211> 51 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon trap clone: et_a_002 exon 1 reverse 
<400> 62 

ctgtcttgcc tcctcgatgg gaaaagaaac aagcgcacta atggtgcatt t 
51 



<210> 63 
<211> 131 



Paae 16 



<212> DNA 

<213> Artificial 

<220> 

<223> Exon trap clone: et_a_002 
<400> 63 

ctggagcatc aggggtgtct tctatgatca 
60 

gcagacttct gcttggtcac tctgatagct 
120 

atggggaaac t 
131 



<210> 64 

<211> 44 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon trap clone: et_a_003 
<400> 64 

cttttacata gaatggtaac tccttttgca 
44 



<210> 65 

<211> 170 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon trap clone: et_a_004 
<400> 65 

aaagttggta gttcgctccc gggctgatgc 
60 

acatcctgca gccacacggg aggtggctcc 
120 

aggggacacc gggcacagct tgaagcttgg 
170 



<210> 66 
<211> 93 
<212> DNA 



exon 2 reverse 

aggaaggaag ccactcaggg tgatagagct 
ctgggaacac tgtgcacctc tctggctgtg 



forward 

cctcgtgttt tttc 
reverse 

tcagagtgtg gaacttgagg agctgcggtg 
tcaggggcga ttgctggctg tgtcaccacc 
ggacagggag ctgagaggac 
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<213> Artificial 
<220> 

<223> Exon trap clone: et_c_001 forward 
<400> 66 

qattacatgg actactatat ttaaaattcc ttctaaactt tttcccattt ctgctcaatt 
60 

ttcattctcc aatatttgca aaacttaaag ttc 
93 



<210> 67 

<211> 60 

<212> DNA. 

<213> Artificial 

<220> 

<223> Exon trap clone: et_c_002 forward 
<400> 67 

gctgaacatt atttctttat tccagattag aggactagga. ttcatgggat tatgcatcaa 
60 



<210> 68 

<211> 68 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon trap clone: et_c_003 reverse 
<400> 68 

ggaaatcttg aaatggcaaa cctttcagaa gagatggcag agactctcct acatattctg 
60 

ttctcaat 
68 



<210> 69 

<211> 92 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon trap clone: et_c_004 exon 1 reverse 

<400> 69 

acactggaag aattggtgtc taggcagtct gggataatag cctagttcta aggacattat 
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60 



cattgatccc tttataggcc atagacctcc at 
92 



<210> 70 

<211> 96 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon trap clone: et_c_004 exon 2 reverse 
<400> 70 

ttcttcctgt tggtgcagga gggtgattaa gggcttttcc taccttaagt tgatcaaa 
60 

ggtattttca taagattaat ctggcagcag aatgca 
96 



<210> 71 

<211> 79 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon trap clone: et_c_005 forward 
<400>- 71 

cttggttggg aaaatatggc caccatattg ctgggaaagc caccaagagt ggactgtta 
60 

caatatccaa gggacatga 
7 9 



<210> 72 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: a_r_001 forward 
<400> 72 

gcttggaact tgaggtgctc 
20 



<210> 73 
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<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: a_r_001 reverse 
<400> 73 

ggagatgtgg gcttgtgagt 
20 



<210> 74 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: a_r_002 forward 
<400> 74 

ctgtgggtgc attaggtgtg 
20 



<210> 75 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: a_r_002 reverse 
<400> 75 

ctggtacatg ctgcctgct 
19 



<210> 76 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: b_f_001 forward 
<400> 76 

gacctctttt gagaaagtca gca 
2 3 



<210> 77 
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<211> 
<212> 
<213> 



20 
DNA 

Artificial 



<220> 

<223> Primer: b_f_001 reverse 
<400> 77 

aaagcaatgg caacaaaagc 
20 



<210> 78 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: b_r_001 forward 
<400> 78 

agagggagga aagagccatc 
20 



<210> 79 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: b_r_001 reverse 
<400> 79 

gttgtacggg ctgcagaatc 
20 



<210> 80 

<211> 24 

<212> DNA 

<213> Artificial 



<220> 

<223> Primer: b_r_002 forward 
<400> 80 

tgagtcctaa accgtacaca taca 
24 



<210> 81 
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<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: b_r_002 reverse 
<400> 81 

tttctgtgcg tgagaacaca 
20 



<210> 82 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_f_001 forward 
<400> 82 

tctctgtggt gctgatcctg 
20 



<210> 83 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_f_001 reverse 
<400> 83 

gcaagaatct gggctctcac 
20 



<210> 84 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_f_001b forward 

<400> 84 

atccctattc gccccttaga 
20 



<210> 85 
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<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_f_002 forward 
<400> 85 

acctcagggt gcagctttta 
20 



<210> 86 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_f_002 reverse 
<400> 86 

tgagcagttt cccactacca 
20 



<210> 87 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_f_003 forward 
<400> 87 

gctgcagtta gctaaaccaa gac 
23 . . 



<210> 88 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_f_003 reverse 
<400> 88 

ttctgcaagg gtctggttct 
2 0 



<210> 89 



Paae 23 



<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_r_001 forward 
<400> 89 

cacagaagcc agggatcg 
18 



<210> 90 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_r_001 reverse 
<400> 90 

gcatctcgcc ctttcctc 
18 



<210> 91 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_r_002 forward 
<400> 91 

caacactgta caccgcaaca 
20 



<210> 92 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_r_002 reverse 
<400> 92 

ttctccaaag tccgatacct g 

21 



<210> 93 



Dano 94 



<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_r_003 forward 
<400> 93 

tggagacatt cacaacgtca a 
21 



<210> 94 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_r_003 reverse 
<400> 94 

tggtagtggg aaactgctca 
20 



<210> 95 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_r_004 forward 
<400> 95 

agctgcctga cttcttggaa 
20 



<210> 96 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_r_004 reverse 
<400> 96 

cttgcccaca ccttgatctc 
20 



<210> 97 



Panp. ?F> 



<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_r_005 forward 
<400> 97 

cgtgctggat tcctatttgg 
20 



<210> 98 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: c_r_005 reverse 
<400> 98 

cccacttctt ctccaaagtc c 
21 



<210> 99 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: AdlYExl forward 
<400> 99 

gactcctggc cttgacttga 
20 



<210> 100 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: cfl forward 
<400> 100 

tctctgtggt gctgatcctg 
20 



<210> 101 



Paae 26 



<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: cfl-117 forward 
<400> 101 

ggaggagcaa aaacaagaag aga 
23 



<210> 102 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: cfl-205 forward 
<400> 102 

actgatgagc acgggaacc 
19 



<210> 103 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C17c forward 
<400> 103 

tccatcctga aagtgcctg 
19 



<210> 104 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C18 forward 
<400> 104 

acatgtatac atgctgccaa 
20 



<210> 105 



Paae 27 



<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: AdlYEx5 forward 
<400> 105 

cagcgaagga aagcacattt 
20 



<210> 106 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: cf 1-1815 forward 

<400> 106 
ggcgacctga aggggact 
18 



<210> 107 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C21 forward 
<400> 107 

ctgtccagtc ctcaggaagc 
20 



<210> 108 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: cfl-4679 forward 

<400> 108 
gaagcatcca ccaaagcg 
18 



<210> 109 
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<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: cfl-4a forward 

<400> 109 
acagcgggcg ctatgagt 
18 



<210> 110 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: AdlYEx2 reverse 
<400> 110 

caggatcagc accacagaga 
20 



<210> 111 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C17b reverse 
<400> 111 

ctggggaagt tggattttct c 
21 



<210> 112 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: cfl-227 reverse 

<400> 112 
accaggttcc cgtgctca 
18 



<210> 113 



<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: cfl reverse 
<400> 113 

gcaagaatct gggctctcac 
20 



<210> 114 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C17c reverse 
<400> 114 

actgtgattc ccaccgtgat 
20 



<210> 115 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: C18 reverse 
<400> 115 

ttgttttgag gaacgcctct 
20 



<210> 116 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: cfl-2079 reverse 
<400> 116 

ggatgtggga tctggtgag 
19 



<210> 117 



Paae 30 



<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: AdlYEx5 reverse 
<400> 117 

gggtgtaatt ttctcccatt g 
21 



<210> 118 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: cfl-4810 reverse 
<400> 118 

cgtccgtttc agcagtgaca 
20 



<210> 119 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: cfl-4b reverse 

<400> 119 
ctgacgtccg tcctctgc 
18 



<210> 120 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: cfl-6453 reverse 
<400> 120 

atggacagtg atccggtttc 
20 



<210> 121 



Paae 31 



<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer: cfl-6559 reverse 
<400> 121 

tgagctgcac gatcaacctc 
20 



<210> 122 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR Primer: AdlYExl forward 
<400> 122 

gactcctggc cttgacttga 
20 



<210> 123 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR Primer: cfl forward 
<400> 123 

tctctgtggt gctgatcctg 
20 



<210> 124 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR Primer: AdlYEx5 forward 
<400> 124 

cagcgaagga aagcacattt 
20 



<210> 125 



Paap 32 



<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR Primer: C21 forward 
<400> 125 

ctgtccagtc ctcaggaagc 
20 



<210> 126 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> cfl-4a forward 

<400> 126 
acagcgggcg ctatgagt 
18 



<210> 127 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR Primer: AdlYEx2 reverse 
<400> 127 

caggatcagc accacagaga 
20 



<210> 128 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR Primer: cfl reverse 
<400> 128 

gcaagaatct gggctctcac 
20 



<210> 129 



<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR Primer: AdlYExS reverse 
<400> 129 

gggtgtaatt ttctcccatt g 
21 



<210> 130 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR Primer: cfl-4b reverse 

<400> 130 
ctgacgtccg tcctctgc 
18 



<210> 131 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR Primer: cfl-6452 reverse 
<400> 131 

atggacagtg atccggtttc 
20 



<210> 132 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR primer for exon trap clone: eta2 forward 
<400> 132 

gcaccattag tgcgcttgt 
19 



<210> 133 



Paae 34 



<211>< 

<212> 

<213> 



20 
DNA 

Artificial 



<220> 

<223> RT-PCR primer for exon 
<400> 133 

gagcatcagg ggtgtcttct 
20 



<210> 134 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR primer for exon 
<400> 134 

ttacatagaa tggtaactcc ttttgc 
26 



<210> 135 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR primer for exon 
<400> 135 

aactcctttt gcacctcgtg 
20 



<210> 136 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR primer for exon 
<400> 136 

gctgatgctc agagtgtgga 
20 



<210> 137 



trap clone: eta2 reverse 



trap clone: eta3a forward 



trap clone: eta3b forward 



trap clone: eta4a reverse 



Pane 



<211> 19 

<212> DNA 

<213> Artificial 

<22?> RT-PCR primer for exon trap clone: eta4b reverse 
<400> 137 

gattgctggc tgtgtcacc 
19 



<210> 138 
<211> 26 
<212> DNA 
<213> Artificial 

<22?> RT-PCR primer for exon trap clone: etcla forward 
<400> 138 

tttaaaattc cttctaaact ttttcc 
26 



<210> 139 
<211> 21 
<212> DNA 
<213> Artificial 

<223> RT-PCR primer for exon trap clone: etclb forward 
<400> 139 

cccatttctg ctcaattttc a 
21 



<210> 140 
<211> 26 
<212> DNA 
<213> Artificial 

<223> RT-PCR primer for exon trap clone: etc2a forward 
<400> 140 

gctgaacatt atttctttat tccaga 
26 

<210> 141 



Paae 36 



<211> 23 

<212> DNA 

<2I3> Artificial 

<220> 

<223> RT-PCR primer for exon 
<400> 141 

agaggactag gattcatggg att 
23 



<210> 142 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR primer for exon 
<400> 142 

tgaaatggca aacctttcag a 
21 



<210> 143 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR primer for exon 
<400> 143 

ggcagagact ctcctacata ttc 
23 



<210> 144 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> RT-PCR primer for exon 
<400> 144 

tggcctataa agggatcaat g 
21 



<210> 145 



trap clone: etc2b forward 



trap clone: etc3a reverse 



trap clone: etc3b reverse 



trap clone: etc4 forward 



Paoe 37 



<211> 20 

<212> DNA 

<213> Artificial 

<223> RT-PCR primer for exon trap clone: etc4 reverse 
<400> 145 

ggtgcaggag ggtgattaag 
20 



<210> 146 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 



<223> RT-PCR primer for exon trap clone: etcSa forward 



<400> 146 

gaaagccacc aagagtggac 
20 



<210> 147 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 



<223> RT-PCR primer for exon trap clone: etc5b forward 



<400> 147 

accaatatcc aagggacatg a 
21 



<210> 148 

<211> 10 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon 1: Intron/Exon ADLX boundary 

<400> 148 
gagctgcctc 
10 



<210> 149 



Page 38 



<211> 10 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Exon 2: Intron/Exon ADLX boundary 

<400> 149 
gagctgcctc 
10 



<210> 150 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon 2: Intron/Exon ADLX boundary 
<400> 150 

tctacctcag gtatccgaga 
20 



<210> 151 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon 2: Intron/Exon ADLY boundary 
<400> 151 

tctacctcag gtatccgaga 
20 



<210> 152 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon 5: Intron/Exon ADLX boundary 
<400> 152 

tttgttttag gaattctgaa 
20 



<210> 153 



Paqe 39 



<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon 5: Intron/Exon ADLY boundary 
<400> 153 

tttgttttag gaattctgaa 
20 



<210> 154 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon 6: Intron/Exon ADLX boundary 
<400> 154 

ttttctccag gagctcttat 
20 



<210> 155 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon 6: Intron/Exon ADLY boundary 
<400> 155 

ttttctccag gagttcttat 
20 



<210> 156 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon 7: Intron/Exon ADLX boundary 
<400> 156 

ttttctgtag ttttgatagc 
20 



<210> 157 



Pane 6 D 



<211> 20 

<212> DNA 

<2i3> Artificial 



<223> Exon 7: Intron/Exon ADLY boundary 
<400> 157 

ttttctgtag ttttgatagt 
20 



<210> 158 

<211> 20 

<212> DNA 

<213> Artificial 



<223> Exon 1: Exon/Intron ADLX boundary 
<400> 158 

ccaaggacag gtgaggaccc 
20 



<210> 159 
<211> 20 
<212> DNA 
<213> Artificial 

<223> Exon 1: Exon/Intron ADLY boundary 
<400> 159 

ccaaggatag gtgaggaccc 
20 



<210> 160 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon 2: Exon/Intron ADLX boundary 
<400> 160 

tcaatttggg gtttgtacca 
20 



<210> 161 



Page 41 



<211> 20 

<212> DNA 

<2i3> Artificial 

<223> Exon 2: Exon/Intron ADLY boundary 
<400> 161 

tcaatttggg gtttgtacca 
20 



<210> 162 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon 5: Exon/Intron ADLX boundary 
<400> 162 

gtttccacag gtaatatgtt 
20 



<210> 163 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon 5: Exon/Intron ADLY boundary 
<400> 163 

gtttccacat gtaagatttt 
20 



<210> 164 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon 6: Exon/Intron ADLX boundary 
<400> 164 

cgctcttcag gtaggcagct 
2 0 



<210> 165 
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<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon 6: Exon/Intron ADLY boundary 
<400> 165 

cgcttttcag gtaggcagct 
20 



<210> . 166 

<211> 11 

<212> DNA 

<213> Artificial 

<220> 

<223> Exon 7: Exon/Intron ADLX boundary 

<400> 166 
atattctccc c 
11 



<210> 167 

<211> 11 

<212> DNA 

<213> Artificial 



<223> Exon 7: Exon/Intron ADLY boundary 

<400> 167 
atattctccc c 
11 
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